Building knowledge graphs of regulatory documentation based on semantic modeling and automatic term extraction
Annotation
The paper proposes a new complex solution for automatic analysis and terms identification in regulatory and technical documentation (RTD). The task of terms identification in the documentation is one of the key issues in the digitalization dealing with the design and construction of buildings and structures. At the moment, the search and verification of RTD requirements is performed manually, which entails a significant number of errors. Automation of such tasks will significantly improve the quality of computer-aided design. The developed algorithm is based on such methods of natural language analysis as tokenization, search for lemmas and stems, analysis of stop words and word embeddings applied to tokens and phrases, part-of-speech tagging, syntactic annotation, etc. The experiments on the automatic extraction of terms from regulatory documents have shown great prospects of the proposed algorithm and its application for building knowledge graphs in the design domain. The recognition accuracy for 202 documents selected by experts was 79 % for the coincidence of names and 37 % for the coincidence of term identifiers. This is a comparable result with the known approaches to solving this problem. The results of the work can be used in computer-aided design systems based on Building information modeling (BIM) models, as well as to automate the examination of design documentation.
Keywords
Постоянный URL
Articles in current issue
- Designing a side-emitting lens usingthe composing method
- Laser multiparameter method for incoming inspection of the mounting elements used in the volume of sealed neodymium laser emitters
- Adaptive anti-thermal imaging protection for moving objects
- The parametric convergence performance improvement in the direct adaptive multi-sinusoidal disturbance compensation problem
- The modal sensitivity, robustness and roughness of dynamic systems(review article)
- Numerical simulation of functional characteristics of solar elements InGaAsN/Si
- Solgel synthesis of Gd2O3:Nd3+ nanopowders and the study of their luminescent properties
- Detection of a small target object in blurry images affected by affine distortions
- An information system for spatial visualization of prognostic and retrospective data on the probability of observing auroras
- Applying bagging in finding network traffic anomalies
- An analysis of the ways to reduce the vulnerability of networks based on the sequential removal of key elements
- The robust distributed ledger model for a multidimensional blockchain security analysis
- Influence of the temperature factor on the deformation properties of polymer filaments and films
- A one-step optimization method for a compressor wheel of a microturbine engine
- The influence of viscosity and turbulence on the supersonic flow compression and expansion corner
- Modeling the relationship between the hardness and wear resistance of materials during their comparative testing by the “block-on-ring” method
- Application of a short-pulse ultra-wideband probing signal for estimating reflective characteristics